Coal-Miner: a coalescent-based method for GWA studies of quantitative traits with complex evolutionary origins
نویسندگان
چکیده
Association mapping (AM) methods are used in genome-wide association (GWA) studies to test for statistically signicant associations between genotypic and phenotypic data. e genotypic and phenotypic data share common evolutionary origins – namely, the evolutionary history of sampled organisms – introducing covariance which must be distinguished from the covariance due to biological function that is of primary interest in GWA studies. A variety of methods have been introduced to perform AM while accounting for sample relatedness. However, the state of the art predominantly utilizes the simplifying assumption that sample relatedness is effectively xed across the genome. In contrast, population genetic theory and empirical studies have shown that sample relatedness can vary greatly across dierent loci within a genome; this phenomena – referred to as local genealogical variation – is commonly encountered in many genomic datasets. New AM methods are needed to beer account for local variation in sample relatedness within genomes. We address this gap by introducing Coal-Miner, a new statistical AM method. e Coal-Miner algorithm takes the form of a methodological pipeline. e initial stages of Coal-Miner seek to detect candidate loci, or loci which contain putatively causal markers. Subsequent stages of Coal-Miner perform test for association using a linear mixed model with multiple eects which account for sample relatedness locally within candidate loci and globally across the entire genome. Using synthetic and empirical datasets, we compare the statistical power and type I error control of Coal-Miner against state-of-theart AM methods. e simulation conditions reect a variety of ∗To whom correspondence should be addressed. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for prot or commercial advantage and that copies bear this notice and the full citation on the rst page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).
منابع مشابه
On the prospects of whole-genome association mapping in Saccharomyces cerevisiae.
Advances in sequencing technology have enabled whole-genome sequences to be obtained from multiple individuals within species, particularly in model organisms with compact genomes. For example, 36 genome sequences of Saccharomyces cerevisiae are now publicly available, and SNP data are available for even larger collections of strains. One potential use of these resources is mapping the genetic ...
متن کاملApplication of an integrated decision-making approach based on FDAHP and PROMETHEE for selection of optimal coal seam for mechanization; A case study of the Tazareh coal mine complex, Iran
Increasing the production rate and minimizing the related costs, while optimizing the safety measures, are nowadays’ most important tasks in the mining industry. To these ends, mechanization of mines could be applied, which can result in significant cost reductions and higher levels of profitability for underground mines. The potential of a coal mine mechanization depends on some important fact...
متن کاملMulti-ethnic studies in complex traits
The successes of genome-wide association (GWA) studies have mainly come from studies performed in populations of European descent. Since complex traits are characterized by marked genetic heterogeneity, the findings so far may provide an incomplete picture of the genetic architecture of complex traits. However, the recent GWA studies performed on East Asian populations now allow us to globally ...
متن کاملFUZZY GRAVITATIONAL SEARCH ALGORITHM AN APPROACH FOR DATA MINING
The concept of intelligently controlling the search process of gravitational search algorithm (GSA) is introduced to develop a novel data mining technique. The proposed method is called fuzzy GSA miner (FGSA-miner). At first a fuzzy controller is designed for adaptively controlling the gravitational coefficient and the number of effective objects, as two important parameters which play major ro...
متن کاملUnderstanding the evolution of defense metabolites in Arabidopsis thaliana using genome-wide association mapping.
With the improvement and decline in cost of high-throughput genotyping and phenotyping technologies, genome-wide association (GWA) studies are fast becoming a preferred approach for dissecting complex quantitative traits. Glucosinolate (GSL) secondary metabolites within Arabidopsis spp. can serve as a model system to understand the genomic architecture of quantitative traits. GSLs are key defen...
متن کامل